Scheduling computations with provably low synchronization overheads
نویسندگان
چکیده
We present a Work Stealing scheduling algorithm that provably avoids most synchronization overheads by keeping processors’ deques entirely private default and only exposing work when requested thieves. This is the first paper obtains bounds on are (essentially) independent of total amount work, thus corresponding to great improvement, in both design theory, over state-of-the-art algorithms. Consider any computation with $$T_{1}$$ critical-path length $$T_{\infty }$$ executed P processors using our scheduler. Our analysis shows expected execution time $$O\left( \frac{T_{1}}{P} + T_{\infty }\right) $$ , incurred during at \left( C_{\mathrm{CAS}} C_{\mathrm{MFence}}\right) PT_{\infty where $$C_{\mathrm{CAS}}$$ $$C_{\mathrm{MFence}}$$ respectively, denote maximum cost executing Compare-And-Swap instruction Memory Fence instruction.
منابع مشابه
Measuring Synchronisation and Scheduling Overheads in OpenMP
| Overheads due to synchronisation and loop scheduling are an important factor in determining the performance of shared memory parallel programs. We present set of benchmarks to measure these classes of overhead for language constructs in OpenMP. Results are presented for three diierent hardware platforms, each with its own implementation of OpenMP. Signiicant diierences are observed, which sug...
متن کاملSimulation of Real-time Multiprocessor Scheduling with Overheads
Numerous scheduling algorithms were and still are designed in order to handle multiprocessor architectures, raising new issues due to the complexity of such architectures. Moreover, evaluating them is difficult without a real and complex implementation. Thus, this paper presents a tool that intends to facilitate the study of schedulers by providing an easy way of prototyping. Compared to the ot...
متن کاملReducing Scheduling Overheads for Concurrent Logic Programs
Strictness analysis is crucial for the efficient implementation of the lazy flmctionM languages. A related technique for the concurrent logic languages (CLLs) called schedule analysis is presented which divides at compile-time a CLL program into threads of totally ordered atoms, whose relative ordering is determined at run-time. The technique enables the enqueuing and dequeuing of processes to ...
متن کاملFIFO with Offsets: High Schedulability with Low Overheads
The OS scheduler’s memory and runtime overheads form crucial design constraints for embedded systems implemented on low-cost hardware platforms. Table-driven scheduling can provide a high level of schedulability; however, it also consumes significant amounts of memory. By contrast, effective non-preemptive scheduling policies, such as the non-workconserving Critical-Window EDF (CW-EDF), have lo...
متن کاملScheduling Imprecise Computations with Wind-up Parts
The imprecise computation model provides the ability to cope with unpredictable workloads. However, there is no consistent way on how to terminate the computation in its early stage. This paper describes a novel approach for safely terminating imprecise computations. First, a new logical part called wind-up part is added to the imprecise computation model. This wind-up part is used by applicati...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Scheduling
سال: 2021
ISSN: ['1099-1425', '1094-6136']
DOI: https://doi.org/10.1007/s10951-021-00706-6